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ABSTRACT 

Recent studies have shown that massive quiescent galaxies at high redshift are much more compact than 
present-day galaxies of the same mass. Here we compare the radial stellar density profiles and the number 
density of a sample of massive galaxies at z ~ 2.3 to nearby massive elliptical galaxies. We confirm that 
the average stellar densities of the z ~ 2.3 galaxies within the effective radius, p(< r e ), are two orders of 
magnitude higher than those of local elliptical galaxies of the same stellar mass. However, we also find that 
the densities measured within a constant physical radius of 1 kpc, p(< lkpc), are higher by a factor of 2-3 
only. This suggests that inside-out growth scenarios are plausible, in which the compact high redshift galaxies 
make up the centers of normal nearby ellipticals. The compact galaxies are common at high redshift, which 
enables us to further constrain their evolution by requiring that the number density of their descendants does 
not exceed constraints imposed by the z = galaxy mass function. We infer that size growth must be efficient, 
with (ri+i/l) ~ {Mm/Mi) 2 . A simple model where compact galaxies with masses ~ 10 n M Q primarily 
grow through minor mergers produces descendants with the approximate sizes, stellar densities, and number 
density of elliptical galaxies with masses 2 - 3 x 10 11 M Q in the local Universe. We note that this model also 
predicts evolution in the Mbh - & relation, such that the progenitors of elliptical galaxies have lower black 
hole masses at fixed velocity dispersion. The main observational uncertainty is the conversion from light to 
mass; measurements of kinematics are needed to calibrate the masses and stellar densities of the high redshift 
galaxies. 

Subject headings: cosmology: observations — galaxies: evolution — galaxies: formation — galaxies: elliptical 
and lenticular, cD 



1. INTRODUCTION 

Several recent studies have found that the oldest and most 
massive galaxies at high redshift have very small sizes (e.g., 
Trujillo et al. 2006; Daddi et al. 2005; Toft et al. 2007; Zirm 
et al. 2007; van Dokkum et al. 2008; Cimatti et al. 2008; van 
der Wei et al. 2008; Franx et al. 2008; Damjanov et al. 2008; 
Buitrago et al. 2008). Although these studies use different 
datasets and methodology they are in good agreement, find- 
ing that the effective radii of red, apparently quiescent galax- 
ies of fixed mass evolved by a factor of ~ 5 since z ~ 2.5 (e.g., 
van der Wei et al. 2008). Initially there were concerns about 
the quality of photometric redshifts, the depth of the imag- 
ing data, and the interpretation of the broad-band spectral en- 
ergy distributions (SEDs), but these were recently addressed 
through deep Gemini/GNIRS near-infrared spectroscopy and 
deep HST/NICMOS imaging of a sample of massive quies- 
cent galaxies at z ~ 2.3 (Kriek et al. 2006; van Dokkum et al. 
2008). 

These small galaxies are remarkable when compared to 
nearby galaxies, as their average stellar densities are a factor 
of > 100 higher than those of red SDSS galaxies of the same 
mass (van Dokkum et al. 2008). Such massive, dense galaxies 
are very rare in the local Universe (e.g., Trujillo et al. 2009) 
but they make up about half of galaxies with M > 10 11 M Q 
at z ~ 2.3 (e.g, van Dokkum et al. 2006; Kriek et al. 2006; 
Williams et al. 2008). 
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Various scenarios have been proposed to explain the ob- 
served properties of the compact galaxies and to describe their 
subsequent evolution. The most straightforward explanation 
is that the masses are overestimated and/or the sizes under- 
estimated. The mass measurements currently rely on fitting 
stellar population synthesis models to the observed photome- 
try and spectra, and these models have considerable system- 
atic uncertainties. A significant uncertainty is the stellar initial 
mass function (IMF): a "bottom-light" IMF, such as proposed 
by, e.g., van Dokkum (2008), Dave (2008), and Wilkins, Tren- 
tham, & Hopkins (2008), would generally lower the implied 
masses, with the precise effect depending on the age of the 
stellar populations. The sizes of the galaxies can be underesti- 
mated in several ways. It may be that the galaxies have strong 
radial gradients in M/L ratio, in which case the luminosity- 
weighted sizes are different from the mass-weighted sizes 
(e.g., Hopkins et al. 2008). We also note that Hopkins et al. 
(2008) predict smaller differences between nearby elliptical 
galaxies and their progenitors, due to contribution of the dark 
matter halos. Limitations in resolution and signal-to-noise ra- 
tio may also play a role, although this seems increasingly un- 
likely. 

Taking the measured masses and sizes at face value, three 
effects have been discussed to explain the observed evolution. 
The first is a variation on "progenitor bias" (van Dokkum & 
Franx 2001), which states that early-type galaxies at high red- 
shift are only a subset of all progenitors of today's early-type 
galaxies. As we discuss later, the number density of the com- 
pact galaxies at z ~ 2.3 is only ~ 7 % of the number density of 
galaxies with the same mass today (see § 5. 1) 4 . Therefore, the 

4 We note that this fraction is smaller than that found in Kriek et al. (2008). 
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compact galaxies may be the progenitors of the most compact 
<~ 7 % of today's galaxies with the same mass (see also Franx 
et al. 2008). This explanation cannot be complete, as the com- 
pact galaxies are small even when compared to this subset of 
the present-day population. The second explanation is minor 
or major merging, which will increase the sizes but also the 
masses (Khochfar & Silk 2006; Naab et al. 2007; Hopkins 
et al. 2008). Significant merging is expected for these mas- 
sive galaxies, e.g., White et al. (2007); Guo & White (2008), 
and merging scenarios have been discussed in several papers 
(e.g., Cimatti et al. 2008; van der Wei et al. 2008). The third 
explanation that has been discussed is expansion of the galax- 
ies as a result of dramatic mass loss due to quasar feedback 
(Fan et al. 2008). 

In this paper we provide new constraints on the evolution of 
compact "red and dead" high redshift galaxies. In § 3 we com- 
pare the radial stellar density profiles of the compact galaxies 
to those of nearby elliptical galaxies. This allows us to de- 
termine whether the compact galaxies resemble the central 
regions of elliptical galaxies, and hence whether normal el- 
liptical galaxies are plausible descendants via merging sce- 
narios. In § 4 we present three simple models to explain the 
growth of compact galaxies into local elliptical galaxies. In 
§ 5 we consider which of the modes is most likely to domi- 
nate galaxy growth by including constraints from the evolu- 
tion of the mass function, and derive a lower bound on the 
amount of size growth for a given amount of mass growth. 
Throughout this paper, we assume a ACDM cosmology with 
Hq = 70kms _1 Mpc _1 , Q m = 0.3, and Q\ = 0.7. 

2. DENSITY PROFILES 

Density profiles of nearby elliptical galaxies and the com- 
pact high redshift galaxies are constructed. For the compact 
galaxies we deproject the Sersic fits presented in vD08, and 
for the nearby galaxies we use a combination of new and lit- 
erature data. 

2. 1 . Surface Brightness Profiles 

2.1.1. High Redshift Galaxies 

We use the sample of nine high redshift "red and dead" 
galaxies previously studied by Kriek et al. (2006) and van 
Dokkum et al. (2008) [hereafter vD08]. The redshifts of the 
galaxies were measured from deep rest-frame optical Gem- 
ini/GNIRS spectra (Kriek et al. 2006). The spectra also 
demonstrate that the light comes from evolved stellar popu- 
lations, as they exhibit prominent Balmer or 4000 A breaks. 
The galaxies were imaged with the Hubble Space Telescope 
(HST) NICMOS2 camera, and with Keck/NIRC2 using laser 
guide star-assisted adaptive optics. As described in vD08, the 
galaxies were fit with Sersic (1968) profiles using GALFIT 
(Peng et al. 2002). Structural parameters for the galaxies are 
listed in vD08. 

Surface brightness profiles in the flieo band were con- 
structed from the Sersic fits. The galaxies are barely resolved 
even with the NICMOS2 camera, and we have essentially no 
information on the form of the density profile within the ef- 
fective radius (0."1, or w 1 kpc). The average density within 
this radius is much better constrained, and this is the param- 
eter that we will use in quantitative comparisons. We note 

The reason is that we adopt a different IMF and therefore a different mass 
limit, and we assume that at z = all galaxies with M > 10" Mq are "red 
and dead" but only ~ 50 % at z = 2.5. 



that GALFIT effectively extrapolates the Sersic fits to the (re- 
solved) structure at large radii inward while conserving the 
total flux, and that therefore the fits may also provide a good 
approximation of the form of the density profile within 1 kpc. 

2.1.2. Nearby Galaxies 

Two sources are used for the nearby sample. The Tal, van 
Dokkum, & Nelan (2009) [hereafter T09] sample is an abso- 
lute magnitude and volume-limited sample of local elliptical 
galaxies, selected from Tully (1988). All galaxies with mor- 
phological type "E", Mb < -20, within declinations of -85 
and +10, galactic latitude > 17° or < -17° and at distances 
of 15-50Mpc were observed with the Yale 1 .0 m telescope at 
CTIO, operated by the SMARTS consortium, in the V band. 
The observing strategy was optimized for flat-fielding accu- 
racy, and the surface brightness profiles can be reliably traced 
to w 29 mag arcsec -2 . The galaxies were fit with isophotal el- 
lipses using IRAF. Apparent magnitudes were calibrated us- 
ing aperture photometry of Prugniel & Heraudeau (1998) and 
then converted to B magnitudes using published B — V colors 
from the same catalogue (neglecting color gradients). Mea- 
surements were corrected for Galactic reddening using in- 
frared dust maps from Schlegel, Finkbeiner, & Davis (1998). 
We assume distance measurements from the Tully catalogue 
(corrected to our cosmology) to convert the luminosity pro- 
files to physical units. 

The T09 sample has the advantage that it is complete down 
to a luminosity limit (which roughly corresponds to a mass 
limit for these luminous red ellipticals), but the disadvantage 
is that it only samples a limited range in mass and luminos- 
ity. We supplemented the T09 sample with photometry from 
Franx, Illingworth, & Heckman (1989); Peletier et al. (1990); 
Jedrzejewski (1987) [hereafter FPJ]. This sample is not com- 
plete but covers a larger range in luminosity. We limited the 
sample to all galaxies that have published B-band profiles. 
Again distances from Tully (1988) were used to convert the 
observed brightnesses to luminosities. 

2.2. Deprojection 

The intensity profiles of the nearby galaxies are fit to Sersic 
profiles of the form 



/(r) = /„exp 
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with n < 4 between radii of 4" out to 20 kpc, or the maxi- 
mum extent of each profile, along the circularized axis, r = 
a\/{\ — e), of the galaxy (Ciotti 1991). b„ is defined as the 
solution to j(2n,b n ) = T(2n)/2. We use the asymptotic ap- 
proximation for b n , which is accurate to a factor of O ~ 10~ 6 : 

„ 1 4 4 6 
b n ^2n-- + — — + — — , (2) 
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(Ciotti & Bertin 1999). For the high redshift galaxies we used 
the fits of vD08. We then performed an Abel Transform to 
deproject a circularized, three-dimensional light profile: 

exp[-b n (0 l/n) t] 
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For both the high redshift sample of compact galaxies and 
the two samples of nearby elliptical galaxies we now have 
circularized radial luminosity density profiles in units of 
L B ,©kpc~ 3 . 
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2.3. Light-to-Mass Conversions 

In order to convert the luminosity density profiles to stel- 
lar mass density profiles we make the following assumptions 
about the mass-to-light (M/L) ratios. For the high redshift 
sample we use stellar masses from Kriek et al. (2008) adjusted 
to a Kroupa (2001) IMF. For the nearby sample, we use the 
well-established relation between M/L ratio and mass to con- 
vert luminosities to masses (e.g., van der Marel 1991). The 
normalization and slope of the relation in the B band were de- 
termined by combining the information in Table 1 of van der 
Marel (1991) and Table 2 in van der Marel & van Dokkum 
(2007): 



^ = (9.04xl0- 4 )(-^ 



0.37 



(4) 



The conversion from luminosity to mass is the largest un- 
certainty in the methodology, in particular the lack of dynam- 
ical measurements that could calibrate the M/L ratios of the 
high redshift galaxies. We will return to this issue in § 6. 

3. COMPARISON OF DENSITY PROFILES 
3.1. Average Profiles 

The stellar density profiles of the compact high redshift 
galaxies are compared to those of nearby elliptical galaxies 
in Fig. Q] The solid line is the average density profile of the 
9 galaxies from vD08. We use a 1000 iteration bootstrap es- 
timation to approximate errors of the average density profile 
due to the small sample size of the high redshift galaxies. The 
1 a contour is shown in dark gray and the 2 a is shown in 
light gray. Broken lines are average profiles of nearby ellipti- 
cal galaxies from the T09 sample, in three different mass bins. 
The lowest mass bin is M > 1O U M0: this is the mass that the 
high redshift galaxies already have at the epoch of observa- 
tion, and therefore the minimum mass of their descendants. 

Figure[T]shows that the discrepancy between the profiles of 
compact high redshift galaxies and nearby elliptical galaxies 
is mostly in the outer regions. Within r w lkpc the average 
stellar density of the high redshift galaxies is greater than the 
density of nearby ellipticals by a factor of a few only, partic- 
ularly for the more massive galaxies in the T09 sample. This 
discrepancy is much smaller than the factor of > 100 differ- 
ence when the density is measured within the effective radius 
(e.g., vD08). Furthermore, our error estimates only address 
the sample bias; this discrepancy is especially small consid- 
ering the other sources of uncertainty in our measurements, 
which we will discuss further in § 6. Outside of this inner re- 
gion, the difference grows significantly; the stellar density of 
nearby elliptical galaxies is a few hundred times higher than 
that of the compact high redshift galaxies at r > lOkpc. We 
infer that in order to evolve into nearby galaxies, compact 
galaxies need not change significantly in the central regions, 
but must grow significantly in their outer regions. 

3.2. Comparison of Masses, Sizes, and Densities 

The relative properties of the high redshift galaxies and 
nearby galaxies are demonstrated in Figure [2] The compact 
galaxies from vD08 are indicated by solid circles. The nearby 
samples are represented by open symbols: squares for T09 
and triangles for FPJ. Only nearby galaxies with sufficient 
mass, M > lO n M0, can be the descendants of the high red- 
shift galaxies. Galaxies with lower masses are denoted with 
light grey symbols. 
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FIG. 1 . — Comparison of the mean stellar density profiles of high redshift 
compact galaxies (solid line) with nearby elliptical galaxies from the T09 
sample (broken lines). High redshift 1 and 2 a contours are shown in gray. 
The average density profile for nearby galaxies with M > 10"Mq is repre- 
sented by the red, dashed line; the green, short dashed line corresponds to 
galaxies with M > 3 X 1O"M0; and the most massive local galaxies with 
M > 5 X 10' 'Mq are shown by the blue, long-dashed line. Note that the pro- 
files of the nearby galaxies are fairly similar to those of the compact galaxies 
at radii r < 3 kpc, qualitatively consistent with expectations for inside-out 
growth. 

The relative compactness of high redshift and low redshift 
galaxies is shown in Figure|2a). There is a clear trend show- 
ing the increasing effective radius with galaxy mass in the 
nearby galaxies. The high redshift galaxies, though in the 
middle of the nearby mass range, are smaller by a factor of 
~ 5 in effective radius. This result confirms previous studies, 
which generally used the Sloan Digital Sky Survey (SDSS) as 
a low redshift comparison point (e.g., Toft et al. 2007; van 
Dokkum et al. 2008; Cimatti et al. 2008; van der Wei et al. 
2008). van der Wei et al. (2008) combine data from the lit- 
erature (in addition to adding new data at z ~ 1) and derive 
an evolution of r e oc (l+z)" 1 ' 2 12 at fixed mass (for samples 
with photometrically determined masses), corresponding to a 
factor of 4.2 ±0.6 at z = 2.3. 

The difference in size at fixed mass implies a significant dif- 
ference in density contained within the effective radius of the 
high redshift and nearby galaxies. We calculated the average 
densities within the effective radius by integrating the stellar 
density profiles derived in the previous Section: 



p{< r) = 



p(r'y ' dr' 



(5) 



with r = r e . This difference is obvious in Figure |2b): the 
vertical axis of this panel demonstrates the factor of > 100 
differences in the average density within the effective radius. 

The horizontal axis of Fig. |2jb) shows the average density 
integrated to r = 1 kpc rather than r = r e . For convenience, 
we will refer to the average density within 1 kpc as the "cen- 
tral density". Since the compact galaxies have effective radii 
~ 1 kpc, the density within r e is approximately equivalent to 
our definition of the central density, placing these galaxies 
along the dashed diagonal line representing the equality of 
the two densities. The nearby sample lies predominantly be- 
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FIG. 2. — Relative properties of nearby and z ~ 2.3 galaxies. The panels show the relations between size and mass (a), density within the effective radius 
and density within 1 kpc (b), density within the effective radius and mass (c), and density within 1 kpc and mass (d). Open symbols are nearby galaxies, solid 
circles are high redshift compact galaxies from vD08. Light grey points are nearby galaxies with masses < 10" Mq, i.e., lower than the high z compact galaxies. 
Arrows begin at mean values of high redshift sample and show predictions from simple models for the evolution of the compact galaxies: blue arrows shows the 
direction of evolution due to equal-mass mergers, green arrows for minor mergers and red arrows for the expansion model. Simple expansion or minor mergers 
can bring the distant galaxies close to the scaling relations defined by nearby galaxies, but equal-mass mergers do not produce galaxies of the right size. 



low this line, with p(< r e ) much lower than p(< 1 kpc) for all 
galaxies with masses > 10 11 M Q . We infer that, although the 
high redshift galaxies have higher densities than nearby ellip- 
ticals overall, the differences are much smaller within 1 kpc 
than within 1 r e . 

Figure |2J C ) an d (d) demonstrate the same point in the den- 
sity versus mass plane. In (c), we show the relation between 
p(< r e ) and total stellar mass. The compact high redshift 
galaxies are clearly much denser than nearby galaxies of the 
same mass. In (d), it is shown that the discrepancy in density 
becomes far less extreme in the central regions of the galax- 
ies. The nearby sample shows opposite trends with mass in 
(c) and (d): the density within the effective radius decreases 
with increasing mass (reflecting the slope of the mass - radius 
relation), but the density within 1 kpc grows with increasing 



mass. Interestingly, the central densities of the high redshift 
compact galaxies are very similar to those of nearby elliptical 
galaxies with masses >5x 10 11 M Q . 

The trends in Fig. [2] are consistent with models in which 
the compact galaxies make up the centers of present-day gi- 
ant ellipticals. Such inside-out formation scenarios are not 
new, and have been explored by, e.g., Loeb & Peebles (2003), 
Bournaud, Jog, & Combes (2007), Naab et al. (2007), and 
Hopkins et al. (2008). The idea is that a compact core is 
formed through highly dissipative processes at z > 3 (see, e.g., 
Robertson et al. 2006a; Dekel et al. 2008), which then grows 
through increasingly dissipationless mergers at lower redshift. 
Independently, Franx et al. 2008 argues that galaxy growth is 
mostly inside-out, based both on the regular evolution of the 
stellar mass-radius relation, and on the fact that star forming 
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galaxies are larger than non-star forming galaxies of the same 
mass. 

4. PREDICTIONS FROM SIMPLE MODELS 

As discussed in § 1, various models have been proposed to 
explain the apparent growth of massive galaxies since z ~ 2.5. 
Here we discuss three possible simple models in the context 
of the relations shown in Fig. |2j equal-mass mergers, minor 
mergers and expansion at fixed mass. We investigate the ef- 
fects of these models in Fig. [2] with arrows. The starting point 
of the arrows is always the mean of the high redshift com- 
pact galaxies, and they all imply a growth in effective radius 
of a factor of 5. We emphasize that we look to constrain the 
dominant mode of galaxy evolution; while individual galax- 
ies in the sample will likely be affected by all of the processes 
discussed below, we focus on the overall trends in the larger 
context of the sample of galaxies. 

4.1. Model 1: Growth via Equal-Mass Mergers 

In this model, the growth is driven by (near-) equal mass 
mergers. These mergers will not only increase the size of the 
galaxies, but also their mass. Applying straightforward virial 
arguments implies 

K 1+2 =K,+K 2 , (6) 

with Ki+2 the kinetic energy of the remnant and K\, K 2 the 
kinetic energy of the progenitors. With K = \Mo 2 we have 



^M u2 a 2 +2 = ^Mia 2 + ^M 2 al, 



(7) 



(8) 



and as M l+2 = Mi +M 2 and M\ = M 2 , we have a 2 +2 = a\. Using 
a 2 oc GM/r, we arrive at 

ri+2 _ Mi+2 

n ~ Mj ' 

the familiar result that mergers lead to an increase in size and 
mass but no change in velocity dispersion (e.g., Barnes 1992). 
We note that these relations are simplifications, which are in- 
consistent with the observed slopes of the stellar mass - radius 
relation and the stellar mass - a relation. Simulations which 
take the initial orbits and effects of energy transfer to the dark 
matter halos into account generally imply a smaller increase 
in size for a given change in mass. Boylan-Kolchin, Ma, & 
Quataert (2006) find that r x+2 /n ~ (Mm/Mi) - 6 - 1 , depend- 
ing on the orbital configuration. 

The blue arrows in Fig. EJa-d) show the effects of equal- 
mass mergers on the various relations between mass, size, and 
density. The density within 1 kpc was calculated by assuming 
that the Sersic indices of the profiles of the compact galaxies 
do not change. The blue arrows imply that the descendants 
of the compact galaxies are the dominant galaxies in massive 
groups and clusters, with stellar masses of ~ 10 12 Mq. As can 
be seen in panel d, the central densities of these galaxies are 
nearly identical to those of the compact galaxies. However, 
as can be seen in panel a, the effective radii of these giant, 
nearby galaxies are a factor of ~ 10 larger than the compact 
objects, not a factor of ~ 5. Therefore, this model is not a 
very good description of the required evolution in panels a - 
c. 

4.2. Model 2: Growth via Minor Mergers 

In this mode of galaxy growth, the progenitor galaxies ac- 
cumulate mass via minor mergers with small systems. The 



difference with the equal-mass merger model is that minor 
mergers are more effective in "puffing up" the size a galaxy 
for a given change in stellar mass. For minor mergers o\ 3> a\ 
in Eq. [7] and therefore 

(9) 



'1+2 



Mi. 



Again using a 2 oc GMj r we have 
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The effective radius grows by the square of the change in 
mass (rather than linearly, which is the case for equal-mass 
mergers) and the velocity dispersion decreases by the square 
root of the change in mass (rather than remaining constant) 
(see also Naab et al. 2009). As an example, eight successive 
M 2 : Mi = 1 : 10 mergers could lead to a factor of ~ 5 increase 
in effective radius, while the mass would grow by a factor of 
~ 2 only. 

The effects of this scenario are shown by the green ar- 
rows in Fig. |2 Again, the density within 1 kpc was calcu- 
lated by assuming that the Sersic index of the profiles re- 
mains unchanged. The compact galaxies have a median mass 
of 1.7 x 10 11 M therefore the minor merger model predicts 
that their descendants are in galaxies with a median mass of 
3-4 x 10 u Mq today. The central densities of these galaxies 
are a very good match to those of the predicted descendants 
(panel d), and the effective radii are a much better match than 
in the equal-mass merger model (panel a). We note here that 
what matters is the direction of the arrows, as their length is 
arbitrarily determined by a growth of a factor of five in r e . Ex- 
tending the green arrows slightly would bring them very close 
to the distribution of nearby elliptical galaxies in all panels. 

4.3. Model 3: Expansion at Fixed Mass 

In the final model that we examine, a galaxy has accumu- 
lated most of its mass by z ~ 2 and then gradually expands 
over time while its mass stays roughly constant. The motiva- 
tion for this class of models was provided by Fan et al. (2008); 
they suggest that a QSO may blow out a large fraction of the 
mass, leading to a significant "puffing up" of the remnant. We 
will discuss whether such models are physically plausible in 
§ 5.2 (see also Trujillo et al. 2009). 

Predictions for these models are indicated by red arrows in 
Fig. [2] By construction, these models predict the right amount 
of size evolution at fixed mass, and therefore produce the 
same values of p(< r e ) as nearby elliptical galaxies. However, 
as can be seen in panels b and d they slightly under-predict the 
central densities of local ellipticals. The values of p(< 1 kpc) 
that are predicted are a factor of ~ 2 lower than those of local 
ellipticals with the same mass. 

4.4. Summary of Model Comparisons 

We conclude that all three simple models bring the com- 
pact galaxies much closer to the relations defined by nearby 
elliptical galaxies. The equal-mass merger model provides 
the worst description of the three as it does not produce ellip- 
tical galaxies of the right size and is therefore probably not the 
dominant mode of growth. The minor merger is more effec- 
tive in puffing up the compact galaxies and, despite its sim- 
plicity, provides a remarkably good description of the masses 
and densities of nearby elliptical galaxies. The expansion 
model provides a good description as well, although it slightly 
under-predicts the central densities of elliptical galaxies. 
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5. DISCUSSION 

The main result of the previous Section, and this paper, is 
that the properties of high redshift compact galaxies can be 
reconciled with those of nearby massive elliptical galaxies. 
The densities of the compact galaxies are similar to the central 
densities of elliptical galaxies, and simple "toy models" can 
be used to describe the evolution. 

5.1. Independent Constraints on Mass Growth 

The amount of mass growth in the models of § 4 is specified 
by the physical mechanism for growth (equal-mass merger, 
minor mergers, and expansion) and the choice of a factor of 
five increase in effective radius. To achieve this increase, the 
equal-mass merger model requires an increase in mass of a 
factor of ~ 5, the minor merger model requires an increase of 
a factor of 2-3, and the expansion model does not require an 
increase at all. 

The evolution of the galaxy mass function provides an in- 
dependent constraint on the mass growth. The compact galax- 
ies are common at the epoch of observation - they constitute 
> 90 % of "red and dead" galaxies at z = 2.3 (vD08) and there- 
fore some ~ 50 % of the general population of galaxies with 
stellar masses > 10 11 M (e.g., Kriek et al. 2006; Kriek et al. 
2008; Williams et al. 2008). The evolution of the galaxy 
mass function has been measured recently by several groups 
(e.g., Drory et al. 2005; Fontana et al. 2006; Marchesini et al. 
2008; Perez-Gonzalez et al. 2008) . Here, we use the results 
from Marchesini et al. (2008), who have combined data from 
both deep and wide surveys in a self-consistent way. Inte- 
grating the Schechter function fit given in Marchesini et al. 
(2008) we find that the integrated number density of galax- 
ies with stellar masses M > lO n M is 7.2+j;] x 10" 5 Mpc" 3 
at z = 2.5. The number density of compact, quiescent galax- 
ies is therefore 3.6^g x 10~ 5 Mpc" 3 , where we assumed that 
the quiescent fraction is 0.5 ±0.1. The stellar mass density in 
these galaxies is 4.8±JJ x 1O 6 M Mpc" 3 . 

This number density and mass density provide strict bounds 
on the typical masses of the descendants of the compact galax- 
ies. If the mass density of the compact galaxies exceeds that 
of local galaxies of a particular mass it is immediately clear 
that these local galaxies cannot constitute the (sole) descen- 
dants. Figure [3] shows the integrated Schechter stellar mass 
function in the local universe in dark grey, as well as the num- 
ber density of compact galaxies with M > 10 1 1 Mq at z = 2.5 in 
light grey. For any descendant population at z = 0. 1 , mass cor- 
responds to a required growth factor, given on the lower axis. 
We first ignore mergers of compact galaxies with themselves 
and address that possibility later. In order for a model to be a 
feasible evolutionary path the implied descendant population 
of galaxies in the local universe must be at least as common 
as the progenitors at high redshift. 

The number density of nearby massive galaxies limits the 
mass growth to a factor of 2-3. For this mass growth each 
compact galaxy has one descendant. Lower mass growth im- 
plies that only a small fraction of massive galaxies today hosts 
a descendant of a compact galaxy. Higher mass growth is not 
allowed, as it would create too many descendants. Of course, 
these constraints are dependent on the masses at z = 2.5, which 
we derived. If these masses are incorrect, this argument might 
change. 

Vertical lines in Fig. [3] indicate predictions from the three 
models discussed in § 4. The expansion model is obviously 
fully consistent with the constraints imposed by the evolution 
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FIG. 3. — Integrated number density of galaxies above a mass limit. The 
horizontal line is the number density of quiescent galaxies with M > 10 1 1 Mq 
at z = 2.5. The diagonal relation is the number density at low redshift as a 
function of the mass limit. The mass limit is indicated in absolute units on 
the top axis, and as a growth factor compared to z = 2.5 on the bottom axis. 
Vertical lines indicate the growth implied by the simple models discussed in 
§ 4. Ignoring merging of compact galaxies with themselves, the mass func- 
tions at z = 2.5 and z, ~ do not allow for growth of more than a factor of 
2-3. Strong merging of compact galaxies is ruled out by the integrated mass 
density at low redshift (see text). 

of the mass function, as it implies no mass growth. We note 
that only ~ 7 % of nearby galaxies with masses > 10 11 M Q 
are descendants of quiescent z = 2.5 galaxies in this model; 
we will return to this point below. Remarkably, we can rule 
out the equal-mass merger model as the main mode of growth 
based on Fig. [3] as it implies a mass growth of a factor of 
~ 5. The number density of nearby galaxies with M > 5 x 
10 11 M Q is lower by more than an order of magnitude than 
the number density of compact galaxies with M > 10 11 M 
at z = 2.5. In the equal-mass merger model, compact galaxies 
can obviously merge with each other, which will lower their 
number density. However, a factor of ~ 5 mass growth is 
not allowed even when compact galaxies are only permitted 
to merge with each other: the stellar mass density in galaxies 
with M > 5 x 10 11 M Q at z = 0.1 is 8.1^ x 10 5 M Q Mpc" 3 , 
a factor of 6 lower than the mass density in compact galaxies 
withM> 1O U M atz = 2.5. 

Also remarkably, the growth in the minor merger model is 
close to the cross-over point, where each compact galaxy has 
one descendant. A plausible explanation is that the central 
parts of many elliptical galaxies formed at z > 2.5, after which 
they grew through minor, mostly dry mergers. 

More generally, we can combine panel a of Fig. [2] with 
Fig. |3] to derive an empirical constraint on the amount of size 
growth for a given amount of mass growth. Parameterizing 
the relation between size growth and mass growth as 



n+2 



M 



1+2 



(11) 



we find that a > 2 to simultaneously satisfy the constraints 
from the evolution of the size - mass relation (Fig. [2ja; van 
der Wei et al. 2008), and from the evolution of the mass func- 
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tion. This limit for a is similar to naive expectations from mi- 
nor mergers, which is why we obtain a good correspondence 
between progenitors and descendants for this class of mod- 
els. The equal-mass merger model has a ~ 1 (or even < 1; 
see Boylan-Kolchin et al. 2006); for the expansion model 
Mi j /Mi = 1 (or even < 1 ) and Eq.QT|is n °t well defined. 

5.2. Which Models are Physically Plausible? 

We expect that each of these toy models is responsible at 
some level for the growth and evolution of galaxies from 
Z ~ 2.5 until today. Observational evidence of merging events, 
both equal-mass and minor, exists at intermediate redshifts 
and can definitely produce growth in galaxy mass and size. 
Mass loss from the central regions of galaxies should also 
occur and would therefore cause increases in galaxy sizes. 
Given this complexity, we hope to identify which of our sim- 
ple models best describes the mechanism responsible for the 
majority of the growth of the compact galaxies at high red- 
shifts into descendant galaxies in the nearby universe. 

In § 5.1 we found that the equal-mass merger model is in- 
consistent with the number density of massive galaxies today. 
We are therefore left with two feasible models, growth via "in- 
situ" expansion or via minor mergers. Both of these modes of 
galaxy growth have the effect of puffing up the galaxies with- 
out extreme mass growth. Number densities of the implied 
descendants of galaxies that have grown via either mode cor- 
respond to sufficiently common galaxies in the local universe. 

Although number density arguments do not immediately 
discredit the expansion model of galaxy evolution, they do 
lead to uncomfortable questions. The implication of no mass 
growth is that only a very small number of nearby galaxies 
with mass > 10 11 M was already formed at z = 2.5: approxi- 
mately 7 % if only quiescent galaxies at z = 2.5 are considered, 
and ~ 14 % if all galaxies with M > 10 11 M Q are considered. 
This raises the question where the progenitors of the remain- 
ing ~ 90 % of today's massive galaxies are at z = 2.5. In a hi- 
erarchical growth scenario, one expects that the most massive 
galaxies today have always been the most massive galaxies. 
Instead, the expansion model implies that the most massive 
galaxies at z ~ 2.5 evolve into a small fraction of average- 
mass elliptical galaxies today. Furthermore, the most massive 
galaxies in the local universe, with masses M > 3 x lO n M 
must then have formed rapidly in the later universe, imply- 
ing an extremely active merging history of smaller objects. 
One might conclude that they formed through star formation 
at lower redshift, but this would be inconsistent with the stel- 
lar ages of massive ellipticals (e.g., Thomas et al. 2005; van 
Dokkum 2008). 

There are other potential problems with the physical model 
proposed by Fan et al. (2008). The growth relies on strong 
heating of the inner regions of the galaxy, such as that pro- 
duced by a central active galactic nucleus (AGN). However, 
the high redshift galaxies in our sample are already shown to 
be quiescent, with old stellar populations. If there was an ac- 
tive central engine at one point in the galaxies' histories, it 
would have already blown out gas and led to expansion of the 
galaxy. While growth through mass loss may have played a 
role in the evolution of such galaxies, it is unlikely to do so 
again between z = 2.5 and z = 0, except possibly through stel- 
lar winds and supernovae. Based on simulations of open clus- 
ters, Fan et al. (2008) argue that there could be a long delay 
between the expulsion of gas and the response of the stellar 
distribution to the new potential, but it is not clear whether 
these simulations can easily be applied to massive galaxies. 
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FIG. 4. — Implied velocity dispersions of high and low redshift galaxies, 
along with approximate predictions from the three models discussed in § 4. 



Finally, the expansion model requires significant fine-tuning 
of the amount of mass that is removed from the galaxies: re- 
moving a small fraction of the mass does not have an appre- 
ciable effect, and removing too much would destroy the galax- 
ies. 

Minor mergers (or rather, "un-equal mass mergers") are ex- 
pected in galaxy formation models, and are predicted to dom- 
inate the mass growth of massive galaxies at late times (e.g., 
Naab et al. 2007; Guo & White 2008). Simulations have 
shown that the central regions of a galaxy can be minimally 
affected by dry mergers but that an envelope of newly ac- 
creted material is formed that grows with time (Naab et al. 

2007) . They have also been observed (e.g., Schweizer & 
Seitzer 1992). van Dokkum (2005) infers that visible tidal 
features around nearby elliptical galaxies are caused by red 
mergers with median mass ratio 1:4. It is an open question 
whether the merger rate is sufficiently high to produce a factor 
of 2-3 growth in mass since z = 2.5. Models do predict high 
accretion rates (e.g., De Lucia et al. 2006; Naab et al. 2007; 
Guo & White 2008), but some observations suggest that mass 
growth may be small for the highest masses (e.g., Cool et al. 

2008) . 

Minor merger models are also qualitatively consistent with 
the uniform and gradual evolution of the size - mass relation 
(e.g., Franx et al. 2008; van der Wei et al. 2008), and the 
apparent lack of old massive compact galaxies in the local 
Universe (e.g., Trujillo et al. 2009). If equal-mass mergers 
were a dominant mechanism one might expect to find some 
galaxies that did not experience a major merger and are there- 
fore left intact at the present day, but this is very unlikely in a 
minor merger model. 

5.3. Implied Velocity Dispersions 

As has been pointed out in several studies, the small sizes 
and high masses of the compact red galaxies imply very high 
velocity dispersions (Toft et al. 2007; Cimatti et al. 2008; 
van Dokkum et al. 2008). Figure |4] demonstrates the implied 
dynamical properties of the nearby and high redshift galaxy 
population as well as the possible evolutionary tracks of these 
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galaxies. 

We calculated the velocity dispersions from the equation 
given in van Dokkum & Stanford (2003): 

logM = 21ogcr + logr c + 6.07, (12) 

with r e in kpc and M in Solar masses. This expression is not 
very accurate as it does not take the relation between M and 
M/L or the effects of a dark halo into account, but it does 
allow a comparison in a self-consistent way. 

Predictions from the simple models of §4 are shown by 
arrows. The expansion model predicts that the dispersions 
decrease over time, as the total mass of the galaxy remains 
constant and the effective radius increases. As discussed in 
§4.1 the equal-mass merger model predicts that the velocity 
dispersions remain constant as the mass grows, which implies 
that the descendants have velocity dispersions that are higher 
than are implied by the galaxies in the local sample. Growth 
by minor mergers presents a possible method of decreasing 
the velocity dispersion of the galaxies, as the expansion is 
a stronger factor than mass growth. This mechanism, again 
shown by the green arrow on Figure|4ja), evolves the compact 
galaxies onto the velocity dispersion trend for local galaxies. 
This assumes mass growth by a factor of 2.1, based on the 
same specific minor merging history described in § 4.2, and 
size growth by a factor of 5. 

6. SUMMARY AND CONCLUSIONS 

The main result from our study is that nearby elliptical 
galaxies have similar average densities within 1 kpc as the re- 
cently discovered compact "red and dead" galaxies at high 
redshift. The descendants of the compact "red and dead" 
galaxies at z>2 could therefore simply constitute the central 
parts of today's massive elliptical galaxies. 

Models dominated by minor mergers (where "minor" im- 
plies "not equal mass") can increase the sizes of the galaxies 
efficiently, without violating constraints from the evolution of 
the evolution of the mass function as measured by Marchesini 
et al. (2008). Interestingly, the evolution of the mass - size 
relation and the mass function together imply that ~ 50% 5 of 
elliptical galaxies with mass > 2-3 x 10 11 M may have the 
remnant of a compact z = 2.5 galaxy with mass > 1 x 10 11 M 
in its center. Models which require energy input by a central 
engine to "puff up" the galaxies can also adequately evolve 
compact galaxies into sufficiently common local counterparts, 
but these models require significant fine-tuning and may not 
be physically plausible as the primary growth mechanism. We 
note that we did not consider star formation as a way to grow 
the compact z ~ 2.3 galaxies. Although it is possible that star 
formation re-starts at lower redshifts, newly formed stars can 
only account for a small fraction of the final mass given the 
stellar ages inferred for massive (>2x IO'IMq) galaxies at 
z = (e.g., Thomas et al 2005, van Dokkum & van der Marel 
2007). Nevertheless, a small amount of star formation could 
help increase the sizes between z ~ 2.3 and z = (see also 
Franx et al. 2008). 

The minor merger model predicts evolution in the Magor- 
rian et al. (1998) relation between black hole mass and veloc- 
ity dispersion. The central black hole will grow from z = 2.3 
to the present, with the amount of growth determined by the 
black hole masses of the infalling galaxies. However, the ve- 



locity dispersion will decrease by a factor of ~ 1.5. There- 
fore, even if the black hole growth is insignificant, black hole 
masses at fixed velocity dispersion will be significantly lower 
at z ~ 2.5 than at z = 0. Robertson et al. (2006b) came to the 
same conclusion using merger simulations, but this prediction 
contrasts with several other studies (e.g., Cen 2007; Woo et al. 
2008). 

The galaxy growth models that we describe here are simple 
and the empirical findings are preliminary, highlighting the 
need for further study. On the modeling side, the main un- 
certainties are whether the merger rate is sufficiently high to 
produce the required growth, and whether a realistic treatment 
of the dark matter and orbital configurations retains the high 
efficiency of minor mergers to "puff up" a galaxy. Whatever 
the dominant physical mechanism turns out to be, we find that 
a > 2 if the relation between size growth and mass growth is 
parameterized as r\+i/r\ = (M\+2/M\) a . 

Inside-out formation via mergers predicts that stars in the 
central regions of a nearby elliptical galaxy are qualitatively 
different from stars at larger radii. Elliptical galaxies do have 
color- and metallicity gradients, which could reflect differ- 
ences in stellar populations between stars formed in-situ and 
those accreted from other systems (e.g., Peletier et al. 1990). 
While it is not yet clear whether these gradients are consistent 
with such accretion scenarios, it may be difficult to recon- 
cile them with an expansion model alone (see, e.g., Pipino & 
Matteucci 2008). It is tempting to identify kinematically de- 
coupled cores (e.g., Franx & Illingworth 1988; Bender 1988) 
with the descendants of the compact galaxies, but the scales of 
these features are typically a few 100 pc rather than ~ 1 kpc. 
More information on color gradients and the inner ~ 1 kpc 
of the compact high redshift galaxies will provide important 
additional constraints. 

Our determinations of stellar density profiles and masses 
can be improved. The calculated density profiles and inte- 
grated masses are based on Sersic profile fits to the galaxy 
light distributions, not on the actual light profiles themselves. 
Furthermore, for the high redshift galaxies the profiles within 
~ 1 kpc are extrapolations, as the galaxies are not resolved 
on smaller scales. The conversion from light to mass is also 
very uncertain. The conversion for the local samples ignores 
scatter in the M/L versus L relation, and ignores gradients in 
M/L ratio. The mass estimates of the high redshift galaxies 
are based on stellar population models and are very sensitive 
to the assumed IMF and to possible contributions from dark 
matter. As noted in § 1 , bottom-light IMFs would change the 
masses and alter the required amount of size- and mass evo- 
lution to bring the galaxies to local relations. 

Measurements of absorption-line kinematics of high red- 
shift compact galaxies would provide a direct test of the IMF, 
and of several of the other assumptions that enter the analysis 
(see, e.g., Cimatti et al. 2008). van der Wei et al. (2008) find 
that the observed size evolution at < z < 1 is similar when 
dynamical masses rather than photometric masses are used, 
but this needs to be verified at higher redshifts. 
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5 In a minor merger model the exact fraction could range from ~ 10% - 
-100%, depending on the order and mass ratio of mergers. 
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